Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition
نویسندگان
چکیده
This paper reports on a project for collection of the sound scene data. The sound scene data is necessary for studies such as sound source localization, sound retrieval, sound recognition and hands-free speech recognition in real acoustical environments. There are many kinds of sound scenes in real environments. The sound scene is denoted by sound sources and room acoustics. The number of combination of the sound sources, source positions and rooms is huge in real acoustical environments. However, the sound in the environments can be simulated by convolution of the isolated sound sources and impulse responses. As an isolated sound source, a hundred kinds of non-speech sounds and speech sounds are collected. The impulse responses are collected in various acoustical environments. In this paper, progress of our sound scene database project and application to environment sound recognition are described.
منابع مشابه
Data collection in real acoustical environments for sound scene understanding and hands-free speech recognition
This paper describes a sound scene database necessary for studies such as sound source localization, sound retrieval, sound recognition and hands-free speech recognition in real acoustical environments. This paper reports on a project for collection of the sound scene data supported by Real World Computing Partnership(RWCP). There are many kinds of sound scenes in real environments. The sound s...
متن کاملReal Environment Acoustic Database
Recently importance of hands-free speech communication is increasingly recognized. The sound data for open evaluation is necessary for the studies such as sound source localization, sound retrieval, sound recognition and hands-free speech recognition in real acoustic environments. This paper reports on our project for the acoustic data collection. There are many kinds of sounds in real environm...
متن کاملLocalization of multiple sound sources based on inter-channel correlation using a distributed microphone system
Recently the importance of hands-free speech interfaces is increasingly recognized. However, in real environments, the presence of ambient noises and room reverberations seriously degrades the performance of the hands-free speech recognition. Reliable sound source localization is necessary to maximize the effect of noise reduction. This paper proposes a new method of multiple sound source local...
متن کاملRobot Audition – Hands - Free Automatic Speech Recognition under Highly - Noisy Environemnts – Kazuhiro NAKADAI
This paper addresses robot audition, which realizes listening capabilities for robots using robot-embedded microphones. For robot audition, we propose real-time sound source separation and automatic speech recognition (ASR) techniques for dynamically changing environments based on microphone array processing, which is applicable to hands-free ASR under highly-noisy environments. Implementation ...
متن کاملEnvironmental Sound Source Identifica Model for Robust Speech
In real acoustic environments, humans communicate with each other through speech by focusing on the target speech among environmental sounds. We can easily identify the target sound from other environmental sounds. For hands-free speech recognition, the identification of the target speech from environmental sounds is imperative. This mechanism may also be important for a selfmoving robot to sen...
متن کامل